Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition
نویسندگان
چکیده
Decision tree based context clustering [Young; '94] ・ Construct a parameter tying structure ・ Can estimate robust parameter ・ Can generate unseen context dependent models ・ Minimum description length (MDL) criterion [Shinoda; '97] Bayesian approach ・ Variational Bayesian (VB) method [Attias; '99] ⇒ Applied to speech recognition [Watanabe; '04] ・ Can use prior information ⇒ Affect context clustering Problems Prior information is not generally given ⇒ Prior distribution becomes tuning parameters
منابع مشابه
Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملContinuous Speech Recognition Based on General Factor Dependent Acoustic Models
This paper describes continuous speech recognition incorporating the additional complement information, e.g., voice characteristics, speaking styles, linguistic information and noise environment, into HMM-based acoustic modeling. In speech recognition systems, contextdependent HMMs, i.e., triphone, and the tree-based context clustering have commonly been used. Several attempts to utilize not on...
متن کاملUsing Hybrid HMM/BN Acoustic Models: Design and Implementation Issues
In recent years, the number of studies investigating new directions in speech modeling that goes beyond the conventional HMM has increased considerably. One promising approach is to use Bayesian Networks (BN) as speech models. Full recognition systems based on Dynamic BN as well as acoustic models using BN have been proposed lately. Our group at ATR has been developing a hybrid HMM/BN model, wh...
متن کاملSpeaker adaptation in noisy environments based on parameter estimation using uncertain data
This paper describes new method for the speaker adaptation of HMM parameters in environments with background noise. This method is based on Bayesian estimation, and calculates the a posteriori distribution of cleanspeech HMM parameters from their a priori distribution by using noisy speech observations. The advantage of the method is that the distribution of the noise can be taken into account ...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کامل